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ABSTRACT 

This paper describes a coding system devised to 
analyze conversations of graduate students in applied linguistics at 
Edinburgh University. The system was devised to test the hypothesis 
that as shared knowledge among conversation participants grows, the 
textual density of in-group members has more cues than that of 
strangers. The informal conversations of 6 students who shared 
classes in linguistics were recorded over the course of the 199 1—92 
academic year. The paper provides details of the lexical and 
grammatical tags applied to the texts of the conversations and 
outlines the code for the analysis of each discourse unit's assumed 
knowledge area, macro-function, speech act, and move. The problems 
encountered in the implementation of this system are discussed and 
explained. (Contains 15 references.) (MDM) 
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A CODING SYSTEM FOR ANALYSING A SPOKEN TEXT DATABASE 
Joan Cutting (DAL) 



Abstract 

This paper describes a coding system devised to analyse 
conversations of 1991-92 Applied linguistics MSc students at 
Edinburgh University. It gives details of (he lexical and grammatical 
tags that are applied to the text itself and outlines the code for the 
analysis of each discourse unit's assumed knowledge area, macro- 
function, speech act, and move. The problems encountered in the 
implementation of this system are discussed and explained, and 
solutions are offered. 



1. Introduction 

There have been several studies, in the fields of socio linguistics and 
psvcholinguistics, thai compare ihe language r/ strangers and that ol friends 
(Tannen 1989; Duck 1991). However, it would appear that there is no longitudinal 
linguistic study that follows through the interactions of speakers from the moment 
that they first meet until they become friends or associates, in order to discover 
exactly how their language develops over lime. 

I am making a developmental study of casual conversations of MSc students from 
the 1991-92 Edinburgh University Applied Linguistics course, aiming to find 
exactly how their language changes as thty form a discourse community. The 
analysis is hoped to explain how the in-groups code becomes increasingly 
restricted (Bernstein 1971) and thus inaccessible to an outsider to this MSc group, 
and to provide a model for predicting language changes during discourse 
community formation. It is not an a priori model: the categories have been devised 
as a result of examining the dialogues with an cihnomeihodological eye. 

In order that the pragmatic analysis of the implicit language that cvoKcs in this 
closed network academic group (U*vy 1979; Krcckcl 1981) can be both qualitative 
and quantitative, I have devised a detailed coding system. The system contains 
certain lexical and grammatical features that depend on the context of the MSc 
course for their full meaning: special terms and names, general words ilialliday 
and Hasan 1976) and exophoric reference, substitution and ellipsis. I «;.ill these 
features implicit 'contextualisation cues', to use Gumper/. s term lor linguistic 
features that contribute to the 'signalling of contextual presuppositions' (1082 71). 
I hypothesise that over time, as the group becomes closer knit, united by the 
common experience of the MSc course, there will be an increase in implicit 
contextualisation cues. 
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The coding system also contains functional categories, since the linguistic features 
must be seen in the context of when, how arc! why they are used, so that a 
statement may be made about group dynamics. Thus I analyse each discourse unit 
in terms of perlocutionary function, implicature and speech act. as well as the 
knowledge area assumed. 

This paper describes each category of my coding system. It discusses the 
difficulties involved in making the system work and adapting the categories to meet 
the needs of natural language. 

2. The Overall Methodolo g y 
2.1 Hypotheses 

I hypothesise that as shared knowledge grows, the textual density of implicit 
contextualisation cues increases and that the language of in-group members has 
more cues than that ot strangers. I hypothesise that course- related topics will be 
more impenetrable than non-course-related topics to an outsider to the course; that 
course-related topics will become more frequent than non-course- related topics; 
and that this will cause the conversations to have larger impenetrable sections 
because of both the assumed knowledge area and the density of implicit 
contextualisation cues . 

My hypothesis about the pattern of development of the implicit contextualisation 
cues is that after the beginning of the MSc course, there will be a peak of special 
terms (eg: 'X-bar'). proper names (eg: 'Chomsky'), demonstrative and 
comparative reference (eg: 'this', 'more'), combined with a drop in explicit 
cndophoric noun phrases with post-head dependents teg: 'that we did', 'in class'). 
As the course progresses, special terms and names will level off and there will be 
an increase in third person personals (eg: 'she'), indefinite pronouns (eg: 
anybody'), exophoric substitution (eg: 'the one') and ellipsis, and superordinates 
(eg: 'book'), and general nouns and verbs (eg: 'thingy'.'do'"). This overall trend 
will be affected by events such as portfolio dates and project deadlines, which lead 
to conversations laden with technical terms and proper names. 

I predict that certain informal aspects of the language will increase as members 
become more familiar with each other: sociocentric tillers such as 'you know' and 
I mean', slang and expletives, and sentences with no initial subject or auxiliary 
;mri unfinished sentences. 

I take into account two secondary but essential factors: cohesion and function. A 
consideration of cohesion should reveal that as the language becomes more 
exophoric, general and bald, as explicitness is no longer necessary, the risk of 
breakdowns and requests for clarification increase, especially in course-related 
topics, because speakers wrongly assume that all ,uc relevant common knowledge 
is in their hearer's mind. 

The analysis of the function of utterances containing cues should show that the use 
of implicit contextualisation cues is a generally expected unmarked means of 
claiming in-group membership (Levinson 1978; Tannen 1989). I hypothesise that. 
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in both transactional and interactional social exchanges, there will be an increase in 
the students* manipulation of shared knowledge as they flout quality maxims. 1 
predict, too. an increase in speech acts that are demonstrations of in-the-same- 
boatness*. such as expressions of dissatisfaction with self or situation, or pesitive 
evaluation of the interlocutor. 

2.2 Method of data collection 

I openly made 15 tape-recordings (a lotal of 4 hours 40 minutes) of MSe student 
conversations in the common room of the Applied Linguistics department from 4 
October 1991 until 12 May 1992. I recorded once a week over three periods of time: 
the first half of the first, second and third term. The conversations (2^. in lotal) were 
spontaneous and unguided. and I kept at a distance at the moment of recording so as 
not to be included. Six native speakers of English wlv, had options in common and 
tended to sit together in the common room consistently were, tour weeks into the 
course, selected for analysis, on the assumption that they were representative of all 
native speakers of English on the course. Once I had transcribed the recordings 
(26.000 words). I disregarded dialogues or long sections of dialogues which did not 
contain at least two of the six chosen speakers. 

2.3 Method of lata analysis 

I analysed all c\scourse units of the text (3.500) in terms of knowledge area and 
function, in six fields. I also tagged the text U : ield seven) lexically and grammatically. 

Figure One: Fields and tags of the coding system 



Functional Fields: 

1) Dialogue and discourse unit numbers 

2) Knowledge area 

3) Macro-function 

4) Move and topic 

5) Speech act 

6) Speakers 

TcM Field a nd lin guistic taps 

7) Lexical : verbs 

nouns 
Grammatical. articles . 

pronouns and adjectives 
substitution and ellipsis 
dependents 



The six functional fields were dialogue number and discourse unit number, 
knowledge area, move and topic, speech act. and speakers Of the text tags, the 
lexical ones were mainly verbs and nouns, tagged with numbcts. jn d the 
grammatical ones were articles, pronouns and adjectives, substitution and ellipsis, 
and dependents tagged with letters/symbols. 

Let us now examine the individual categorisation of these functional labels and text 
tags. All examples quoted in this paper are taken from my database. The numbers 
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beside examples quoted here are from Field one: dialogue arid discourse unit. The 
first two digits indicate the dialogue number; the other three indicate discourse unit 
number. The letters are from Field six: the speakers. AM, BM, CM, DM were the 
men selected for study in this research, and AF, BF the women. 



3. The Individual Categorisation 

3. 1 The discourse unit labels 

3.1.1 Field one: dialogue and discourse unit. 

My criteria for deciding what constituted a discourse unit were as follows: that a 
unit can end where a grammatical sentence ends, regardless of whether this is the 
end of a turn or move: that a unit always ends where there is falling intonation 
combined with a pause longer than 0.5 seconds, even though it happens in the 
middle of a sentence; and that each discourse unit constitutes a speech act. Any 
utterance occurring simultaneously, unless it is unintelligible, is considered a unit. 

3.1.2 Field two: knowledge area 

I divided assumed common knowledge into four areas, in order to distinguish non- 
course- related topics (areas 1, 2 and 3) from course-related topics (area 4). I 
added the symbol P if I felt there was shared, privileged or interpersonal 
knowledge in any of the four knowledge areas. 

l igure Two. Knowledge areas 



1 The world. Edinburgh 

2 Linguistics, language teaching and learning 

3 Edinburgh University. Department of Applied Linguistics, 
Institute for Applied Language Studies. 

4 The 91-92 MSc course in Applied Linguistics of Edinburgh 
University 

(P Privileged or shared knowledge in any of the above four areas) 



When 1 was assigning discourse units to the four knowledge areas. I began to feel 
that I only needed two categories: course- related and non-course-related. However, 
since some topics were more course-related than others and some were course- 
related topics but also related to the 'real world" outside. I maintained the original 
four areas. Because it was hard to say where one knowledge area ended and 
another began, at the discourse unit level, a knowledge area was established for the 
duration of a whole topic, wherever possible. 

3.1.3 Field three: macro-function 

Discourse units were next labelled T for the transactional function or S for the 
social, the interactional. I did not predict an increase or decrease in either over 
time; I wanted to examine the grammatical and lexical tags within the two 




functions. Again, the overall function was established for a whole topic, rather 
than on an individual discourse unit level. 

Figure Three: Macro-functions 



T Transactional, instrumental 
S Interactional, social 

Flouting of quality and clarity maxini> in either of the above two) 



T was the purely transactional functional category, the instrumental such as. 

e.g -01054 BM 'Could you gel me a tuna and sweet corn one please 0 ' 
01055 AM 'Me as well.' 

or learning specifically how to do something, iransmil'ing information needed for 
an immediate task such as: 

e.g. -0601 1 CM Tnc best thing to do is use a die- master card or something.' 
06012 NF 'I don't have a master card ' 

All other units were tagged with S for the social expressive exchange, the 
interactional category, and the phatic. In many cases, the speakers were testing the 
normality of their situation or feelings- 

e.g. - 15020 CM 'But I suppose you're moderately efficient. < 1 )' 

15021 CM 'But I'm not near an outline.' 

15022 DM 'Oh God no I'm on my reading.' 

In others, they were reassuring themselves that they had the same intormalion and 
checking each other's attitude to it: 

c g. -08063 CM 'Five's a bad mark right'*' 

08064 AM 'I think we gel five we're line.' 

08065 CM It's the other mark I want.' 

There is a dine from ihe instrumental to the purely phatic. liven the purely phatic 
could have a tinge of information exchange. Sometimes an interaction that, on the 
surface, was an information exchange could be fundamentally a social exchange. 

In the same field. 1 added a label V to indicate thai the speaker was playing with 
truth and/or the language, flouting cooperative maxims by exaggeration, irony, 
himcr; playing with cliches in mock seriousness, or mixing registers to amuse etc. 
1 wanted to test whether humorous exchange ere a marker ot in-groupness that 
increased over lime: 

c g -04080 BM They're sort of we give change!' 

04081 DM 'If you can get on the bus you get change then ' 

04082 BF 'Ahrighi // that's good.' 

04083 DM 7/ Yeah.' 

04084 AF 'If you don't mind breaking your leg as you try and gel on. 
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3.1.4 Field four: topics and moves 

I was especially interested in examining topic shifts to see how often implicit 
conlextualisaiion cues occurred in the first discourse unit of a new topic. I studied 
patterns of moves, to see whether this revealed something about group dynamics. 

Rgure Hour: Topics and moves 



T Initiate 

R Respond 

1 1st. discourse unit of/within a turn, on a new topic 

2 Subsequent discourse units within a turn, on an established :opic 

3 1st discourse unit of a turn, on an established topic 



Topic' I defined as part of an interaction that can be given a title in the form of an 
indirect question that covers all speakers' contributions to it. Thus an example uf 
this discourse topic would be 'Why X is not going to do any reading' or 'What Y 
did on his mountaineering weekend.' To analyse topics. 1 lagged the first discourse 
unit of a turn or within a turn on a new topic: T; I lagged subsequent discourse 
units within a turn on same topic, even if still answering ihe same question: '2': 
and the first discourse unit of a turn on existing topic: '3'. When the topic drift 
was so gentle that it was almost imperceptible, I placed topic boundaries as close u> 
the centre of the shift as possible. 

I classed the moves T for Initiate and R for Respond. R was either the second half 
of an adjacency pair or simply a minimal response to the preceding discourse, such 
as a backchannel or prompt to continue. T was everything else. T could occur in 
IT, T2 and T3. R tended to occur mainly in R3. as in the first discourse unit of an 
answer or adjacency pair response R2 occurred rarely because subsequent 
discourse units of an answer were not normally specifically required by the 
question. 

e.g - 1 1 105 NM 'You went home?' 

11 106 AM 'I went home round about half past jeven no. ' 

11107 AM 'I didn't go.' 

1 1 IC6 was. strictly speaking, all that was needed lo answer NM's question 1 1 107 
was T2 rather than K2. Finally. Rl was the change of focus to the metalinguistic, 
the request for clarification, or the answer to a question from a previous topic, 
ignoring an interspersed topic. 

3.1.5 Field five: speech act 

The task of classing discourse units in speech acts was particularly daunting 
because speech acts are so elusive and one unit often fits into more than one speech 
act category When a unit did not fit neatly into an act category. 1 created a new 
category, or expanded an existing category, and noted this realisation of the act in 
order to guarantee consistency. 
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I initially detected 40 speech acts, but having failed to make them water-tight and 
mutually exclusive, I opted for grouping them into nine macro speech acts with no 
subdivisions. Discourse units were analysed in terms of attitude (neutral, positive 
and negative) and then in terms of person or object discussed (sell, interloper or 
communication, and third party or situation). This solution obviously did not 
produce a delicate tool but I felt that it would be more manageable for the 
quantitative analysis stage. 



Figure Five: Speech acts 



attitude towards: 


Neutral 


Positive 


Negative 


Self 


1 1 I do/am 


1 2 I do/am good 


13 1 do'am bad 


Interlocutor/ 
communication 


21 You do 'arc 
Here we are 


22 You're good/ 
I'm with you 


23 You're bad ' 
I'm not with 
you 


Third parly/ 
situation 


31 lt/she does- is 


32 U'shc does is 
good 


33 It she does is 
bad 



This paper permits no more than a general view of each macro speech act As tar 
as Attitude towards Self was concerned, within act 11,1 included inform, explain, 
answer and express own intentions and desires: within act 12. express satislaction 
with self, and reassure and console self; within act 13. express dissatislaction with 
self, minimise praise of seit. excuse self and apologise. 

Similarly, for Attitude towards Interlocutor or Communication, there was in act 
21. greet! fill phatically with afterthoughts, backchamiclling. laughter etc., request, 
in act 22 evaluate interlocutor positively, console, encourage, sympathise, agree, 
predict drift, advise and suggest, offer/promise goods or action: and in act 23. 
challenge factual content/truth value, decline, and deny. 

Finally in Attitude towards Third Party or Situation. 1 included in act M. inform, 
explain, ask/answer; in act 32, express satisfaction v>ith third party or situation; 
and in act 33, express dissatisfaction with them, express fear and apprehension. 
And thus I hoped to cover everything that MSc students do with language m the 
common room 

3.2 Field seven: the tagged text 
3.2.1 Lexical tags 

Although the main emphasis of my study is the noun phrase. I included an analysis 
of all verbs, because I wanted to show that over time there would he an increase in 
general verbs 'do* and 'make', in terms of a percentage out of all verbs 

eg. 10069 CM 'I've done all the people' 

I had one category for the course-related general verb and one tor the non course- 
related. 



The other category of verb that I tagged separately from the 'all verbs' category 
was the filler: both sociocentric sequences (Stubbs 1983) such as 'I mean', 'You 
see*, and omissible hedging verbs with or without sentential objects (Brown and 
Miller 1980) such as 'I suppose.' 

c g. - 17020 AF 'It is a lot you know three thousand words'. 



Figure Six: Lexical tags verbs 



10 


All Verbs (excluding be, tiller, general verbs) 


eg 


'bought' 


11 


Be 


eg 


'is' 


12 


Filler 


eg 


'1 mean' 


13 


Non-course general verb 


eg 


'do' 


14 


Course general verb 


eg 


'do' 



In my categorisation of nouns, 1 tagged common nouns, proper nouns and general 
nouns, indicating whether the referent was course-related or non-course-related. 



I tagged expletives and slang, being interested in these as markers of intimacy that 
might increase in number with repeated interaction. I included certain adjectives: 

e.g. -08020 AM ' Bloody tosh, isn't it?' 

08021 CM 'Well it's- it's a bit abstract. ((1))' 

within the group of expletives, and phrasal expressions: 

e.g. - 10029 DM 'So what on earth is it going to be about?' 

uithin the slang group. 

Figure Seven: lexical tags - nouns 



1 1 



Fxpletives 
Slang 



eg:'hell' 
eg:'scivers' 



1 1 



20-22 

20 

21 



Non-course nouns 
Common non-course nouns 
Proper non-course names 
General non-course nouns 
Course nouns - special 



eg: 'budgie 
eg:'Japan' 
eg: 'thing* 



30-50 
30-32 



Common special course nouns 

technical 

unique 

superordinate 

Proper special course names 
actual use 



30 
31 
32 



eg:'X-bar' 
eg: 'portfolio' 
eg:'(syntaxl book' 



40^1 



40 
41 
50 
60 



Course nouas - rourse- by context 



metonymical use 
General special course nouns 



eg:'Chomsky' 

eg: 'Chomsky (studyl* 

eg: 'thing' 

eg: 'work (for the exam]' 
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Categorising nouns with non-course-related referents was quite simple. Words such 
as 'budgie* and 'bus-stop' were classed as common nouns, 'Bush' and 
'Manchester' as proper nouns, and 'thing' and 'people' as general nouns. 

Classing nouns with course- related referents was more complex. These were 
divided into two main groups: special course nouns and course-b> -context nouns. 
The special course nouns, intrinsically course- related, were classed under the 
headings common, proper and general nouns. 

Under the first special course nouns heading, that of common nouns. 1 made three 
sub-divisions - technical, unique and superordinate. I tagged nouns technical if they 
were intrinsically specialised terms independent of the context of the course, 
linguistics and language teaching theory terminology such as 'X-bar', 'diglossia' 
and lexical syllabus". I tagged nouns unique common special course nouns ifihcy 
were single course components such as 'portfolio', 'core project', and 'the 
examiner", used in department organisation, the course handbook etc. 
SuperorJmaie common special course nouns constituted a class of mans possible 
course components. Super ordi nates were count nouns such as I syntax] book'. 
'[S \| class' and 'Itutorial] task' whose precise meaning was not clear since they 
were the second noun of a two-word phrasal expression (Huddleston 1^88:103) 
whose first word (usually a nominal pre-head modifier) was omitted. In this 

e.g. -20020 CM 'SLA class next week and tfie^papcrs due in next f : nda\ " . 
the paper was the core project, the first project paper of the course. 

Under the proper special course noun heading there were two suMiMMonv one 
for proper nouns with their actual intended use, such as 'Chomsky'. HUT and 
'Structuralism', and the other for their metonymical use. in which the noun 
referred elliptically to something other than the course/linguist named. As with the 
superordinate common special nouns, these nouns could be seen as one word of a 
two-word phrasal expression, except that in the case of the metonymical use, it is 
the second word that is omitted. In 

e.g. -08031 AM Though though 1 haven't 1 haven't done am Ciiojlbio ' 
"Chomsky " meant Chomsky study or revision for the forthcoming examination 

The gejieial special course noun category was structurally the same as the general 
non -course noun. To take an example: 

e.g. - 10069 CM No I've done all the pcojjk-" 
Here, 'people' meant theorists or linguists. 

The second main group of course nouns 1 called course by-context noui.s These 
were not inti msically course-related but became course-related by their use in 
course-related topics. The referents of 'discussion', 'work' and 'this week" were 
course-related, for instance, when the 'discussion' was one that the students had 
had in a tutorial, 'work* meant work for the examination, or 'this week' meant this 
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week in classes. Here whal seemed to he missing was a post-head dependent such 
as a relative clause, a prepositional phrase or a content clause. 

3.2,2 Grammatical tags 

Grammatical lags were put on articles, pronouns and adjectives, substitution and 
ellipsis, and dependents. 



f-'igure Eight: Grammatical lags ~ articles, pronouns, adjectives, substitution and 
ellipsis 



Articles 




T/A indefinite article / definite article 




Pronouns and adjectives 




P Indefinite pronoun 


eg:'anybody' 


E Existential pronoun 


eg:'she' 


S Possessive adjective 


eg:'lheir' 


D Demonstrative pronoun / adjective 


eg:'lhis' 


C Comparative pronoun / adjective 


eg:'more' 



I lagged all articles, predicting that there would be an increase in ihe partially 
explicit exophoric use definite noun phrase, especially with course-relaled nouns 
(including general course-nouns), as in this example: 



e.g. 16032 AE 'I think- I find a real loss actually of not having read liifi (0.5) 
Eay Culler article which seems to be underpinning this = ' 

I tagged the indefinite pronouns "everything", 'nobody' etc., feeling thai they have 
an exophoric quality They constituted a 'vague' feature of in-group language. In 
the following example: 

e.g. -25020 NE 'Is there anybody else applying?' 

the full extent of 'anybody else' could only be understood by group members. 

Ihe lagging of pronouns, adjectives, substitution and ellipsis proved to be a 
complex task There were different degrees of endophora and exophora. I 
considered pronouns and adjectives endophoric if the referent was in the text, 
whelhe it was an item or a proposition, even though the reference would have 
been unclear or ambiguous to our hypothetical outsider because of the existence of 
a number of possible textual referents. I decided to tag as semi-implicit 
endophoric. the adjectives that modified general nouns and the pronouns and 
substitutes that had endophoric cohesion to general nouns, exophoric pronouns or 
substitutes, or indefinite pronouns. On the borderline between endophora and 
exophora seemed to be associative anaphora in which the referent could be 
understood from the schemata established by other referents in the text. 

The rest of the pronouns, adjectives, substitutes and ellipses were exophoric or 
miii-anaphoric. Here, the degrees of implicitness were more noticeable. Again I 
differentiated between adjectives that modified common nouns explicitly, and those 
that could be called semi-implicit exophoric because they modified general nouns 
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or exophoric substitutes. Most demonstratives, for example, arc exophoric, but 
demonstrative adjectives seemed more implicitly exophoric when they modified the 
impenetrable, intertextual general nouns or exophoric substitutes. 

e.g. -15045 DM 'So 1 typed that thing up again after you'd gone." 

The pronouns, substitutes and ellipses whose referents were irrecoverably right 
outside the text were classed as totally implicit exophoric. 

Tagging pronoun and adjective categories revealed interesting particularities of 
each. Demonstratives seemed more obviously intertextual than other forms of 
reference. In the following example, the pi/./.a 1 outing had evidently been 
discussed on a former occasion, yd it was exophoric as far as the text was 
concerned 

e g -110X0 AM Did you go to this, piz/a on Friday T 

Some of the characteristics of demonstratives were typical of spoken language. I 
tagged the demonstrative pronoun in the expression like that" in an unfinished 
sentence not as a demonstrative but as a substitute expression, or 'vague 
expression* (Channcll 1085). Familiarity with the presuppositional pool is needed 
to complete the sentence 

e.g. -04099 BF 'And King's Building's got a bar and siulliikejliat ' 

Narrative sections featured the informal demonstrative adjective (his 1 , which 
brings the story closer to the hearers: 

e.g. -02084 MM 'So so he keeps he's got this really incredible pari." 

Comparative reference has three points of reference: the referent mentioned, the 
referent that it is compared to. and the quality that unites or di\ides them 1 was 
tempted to tag some of the cases in my data exophoric. because of the e\ophoric 
comparing quality: 

e.g. ■ 1507o DM The actual thing's exactly ltiCi_aiiillC - * 

I opted not to call these cases exophoric, however, since the 'exophoric ncss' of a 
quality was too flimsy to pin down. 

Two final points need U be made in connection with tagging pronouns and 
adjectives in spontaneous ccordings of groups of three or more speakers The fust 
is that since few recordings contained beginnings of conversations, most 
conversations were captured mid-topic. Thus there were reference items that 
looked as if they might have been endophoric. with the referent probably in the 
preceding, unrecorded, stretch of conversation Faced with this uncertainty. I 
tagged such cases using guesswork based on the text. 

The second point is that some speakers missed the textual referents because they 
joined conversations already under way. Thus, although 1 might tag one item of 
reference as endophoric. it could seem exophoric to some hearers. Witness: 
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e.g. -11064 BP And they were doing a profile of him ' 

1 1065 BP 'And I was son of = ' 

11066 NM 'ffliQ was it? 1 

1 1067 BP 'He. was a guy who wrote this dirty book ' 

11068 BP 'Melvyn Bragg? ' 

Here, BP had to supply the missing restrictive post-head dependent and proper 
name, to repair the breakdown. 

Fagging substitution and ellipsis, I found that exophoric substitution was 
predictably rare, apart from the vague substitute expression at the end of the 
unfinished sentence, and that ellipsis was more interesting to tag as there were 
more instances of implicit use. Here is an example of semi-implicit endophoric the 

them' in line 25014 is exophoric 

e.g. - 25014 CM So what are em how many of them are there do vou know'' 

25015 BP 'I don't know. (0.5)' 

25016 BP" 'Don't know/ 

25017 BP 'Em/ 

25018 Br | think (1) three or two I'm not sure." 

I tagged ellipsis of initial subject or auxiliary, interested in it as an example of 
speakers' inexphcitness about the "given" part of the information structure. This 
informal style could be a marker of intimacy 

e.g. -24040 DM ' Something wrong isn't there somewhere?' 

The ultimate ellipsis is the unfinished sentence: the speaker assumes that the hearer 
can supply the missing information. 

e g. -26131 BM '-<' Have you got this on the the drive at the-?' 
26132 CM No if no you can t do that.' 

I call an unfinished sentence one that is deliberately left in the air', and not one 
that is unfinished because another speaker interrupts. 

Hie last grammatical elements, dependents, were fairly straightforward to tag. I 
predicted that as assumed common knowledge increased, there would be less need 
to identify the nouns with a description, whether in the form of a pre-head 
dependent (determiner or modifier) or a post-head dependent (complement, 
modifier and peripheral). Post-head dependents will abound in the first half of the 
course: 

eg: 16088 AF 1 mean it all to do depends on the point of view of the 
reseaich which we're not really 11 adequate to jn^ gfi ' 

whereas, by the end. the hearer will probably have to supply them: 

eg: 27128 DM 'If you're answering questions anyway so (3) shouldn't be a 
problem. ((2))' 



12 

14 



Figure Nine: Grammatical tags - dependents 



Pre-head determiner, modifier 

Post-head complement, modifier, peripheral 



4. Conclusion 

4.1 Summary and comments 

This paper has examined a coding system for analysing the recorded casual 
conversations collected in the 1991-92 Applied Linguistics MSc common room. It 
has explained the system of six functional fields to analyse the speech acts and 
knowledge areas of each discourse unit, and one actual text field with lexical and 
grammatical tags. The problems of each have been detailed and the solutions 
described. 

Although I first constructed the coding system using the information that I had 
gained from an impressionistic global survey of the data, once I was actually using 
the code to analyse the dialogues, I had to adjust it constantly in response to the 
dictates of the data. On finishing the coding. I had to adapt and re-shape the system 
again to make all the categories fit together into a coherent model. Finally. I have 
confirmed the workability of the code, using intercoder reliability tests 

4.2 The next stage in the research 

Using the Hxcel database, the next stage is to count the implicit contevtualisation 
cues and other linguistic items, as well as the discourse units in the function fields 
of knowledge area, macro-function, topic shifts and moves, and speech acts, in 
each dialogue; and to find the average percentage of these for each of the three 
recording periods of the course, and thus observe whether there are any obvious 
changes over time: whether knowledge area four topics do increase: whether the 
number of cues does increase; which cues increase at which point: whether 
language becomes more informal, and so on. The second stage is m carry out a 
qualitative analysis of chunks of dialogue or even whole dialogues to find 
characteristics of the ways that cues relate to each other within a text 

The third stage is to put all the information together, to observe linguistic items in 
conjunction with the various functional fields and discover whethei their density 
varies according to the knowledge areu. the macro- function and the speech act. 
This would test the hypotheses that couise- related topics have more implicit, 
impenetrable language; and that the use of cues is related to the claiming of in- 
group membership. Impenetrability itself will be tested through questionnaires 
given to speakers from each of the four knowledge areas. 

The subsequent statistical analysis should help me to select the most significant 
implicit contextualisation cues to include in the final model which should suit any 
body of data taken from conversations between in-group members of an academic 
discourse community. I hope eventually to make the model gencralisable to 
English speaking academic discourse communities in Britain, and possibly non- 
13 15 
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academic discourse communities to explain why their conversations can sometimes 
be impenetrable to outsiders. 
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